Optimizing a 3D Multigrid Algorithm for the IA-64 Architecture
نویسندگان
چکیده
Multigrid methods are known to be the numerically most efficient algorithms for solving large linear equation systems. Unfortunately a standard implementation runs with a, compared to the theoretical capabilities of modern processors, disappointing processor efficiency. While it is possible to achieve significant speedups with cache blocking techniques in 2D, 3D problems often don’t benefit from optimizations to the same degree. The IA-64 architecture is a revolutionary new architecture to overcome the problems and limitations of the established dynamically scheduled, speculative execution processors. One of the key concepts is to put large parts of the complexity on the software side, hence the compilers. We present results for an highly optimized multigrid solver in 3D, showing the potential of the IA-64 architecture. Because the compiler is not yet capable to exploit the full potential of the architecture for our type of algorithm the computational intensive parts are implemented in assembly.
منابع مشابه
Optimising a 3D multigrid algorithm for the IA-64 architecture
Multigrid methods are amongst the most efficient algorithms to numerically solve partial differential equations. However, standard implementations usually cannot exploit the potential of modern processors. The IA-64 architecture transferes most complexity to the software side to provide a highly superscalar design with large caches, leading to unique control over the actual execution. Exemplifi...
متن کاملGOLIAT: an Optimizing Linker for the IA-64 ArchiTecture
Modern compilers are extremely sophisticated and complex. This enables them to produce highly optimized code. However, due to the limited scope of compilers, the optimization opportunities that arise from analyzing and optimizing across module boundaries remain largely unexplored. We examined the viability of optimizing statically linked programs for the IA-64 architecture. As a result, the cod...
متن کاملEfficiency improvements of iterative numerical algorithms on modern architectures
For many numerical codes the transport of data from main memory to the registers is commonly considered to be the main limiting factor to achieve high performance on present micro architectures. This fact is referred to as the memory wall. A lot of research is targeting this point on different levels. This covers for example code transformations and architecture aware data structures to achieve...
متن کاملSoftSDV: A Presilicon Software Development Environment for the IA-64 Architecture
New instruction-set architectures (ISAs) live or die depending on how quickly they develop a large software base. This paper describes SoftSDV, a presilicon software-development environment that has enabled at least eight commercial operating systems and numerous large applications to be ported and tuned to IA-64, well in advance of ItaniumTM processor’s first silicon. IA-64 versions of Microso...
متن کاملUsing the SGI Pro64 Open Source Compiler Infra-Structure for Teaching and Research
Modern optimizing compilers are complex programs that require from tens to hundreds of people-years to be developed. Thus professors must use third-party compiler infra-structures to introduce students to compiler optimizations. Until recently only infra-structures developed at universities, research institutes, or by GNU were widely available for teaching. However, in May 2000, SGI made public...
متن کامل